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'S ■ Abstract 



This paper considers the empirical hkelihood (EL) construction of confidence inter- 
vals for a hnear functional 6 based on right censored hfetime data. Many of the results 
in literature show that —2 log (empirical likelihood ratio) has a limiting scaled-Xi dis- 
- tribution, where the scale parameter is a function of the unknown asymptotic variance. 

b:^ ! The scale parameter has to be estimated for the construction. Additional estimation 

would reduce the coverage accuracy for 6. This diminishes a main advantage of the 
"k>i ' EL method for censored data. By utilizing certain influence functions in an estimating 

c^ . equation, it is shown that under very general conditions, —2 log(EL ratio) converges 

weakly to a standard xl distribution and thereby eliminates the need for estimating 
the scale parameter. Moreover, a special way of employing influence functions eases 
the otherwise very demanding computations of the EL method. Our approach yields 
smaller asymptotic variance of the influence function than those comparable ones con- 
sidered by Wang & Jing (2001) and Qin & Zhao (2007). Thus it is not surprising 
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that confidence intervals using influence functions give a better coverage accuracy as 
demonstrated by simulations. 
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1 Introduction 

Let y be a non-negative random variable with distribution function F{y) = P[Y < y]. 
Let C be a nonnegative random variable with distribution function G{x) = P[C < x] 
and C is independent of Y. Instead of Y, we observe Z = min(y, C) and the indicator 
6 = I[Y < C] oi the event [Y < C]. Both F and G are assumed continuous but 
unknown. In this paper we use the empirical likelihood method with right-censored 
data to study the problem of constructing confldence intervals for 6, a functional of the 
distribution F deflned by Eg{Y,6) = 0. For example, if g{x,6) = ^{x) — 6, for some 
function ^ we get 

/■oo 

e = EaY)= / ax)dFix). (1. 1) 

The problem of estimating 6 in (1.1) with a sample of n i.i.d. observations of {Z, S) 
has been studied by many authors. If ^{Y) = I[Y < y] for a flxed y, then 6 = F{y). A 
well-known nonparametric maximum likelihood and asymptotically optimal estimator 
of F{y) is the Kaplan-Meier (KM) estimator F„ as deflned in (2.3). A natural estimator 
of ^ is 

POD 

en= / e(s)dF„(s). (1. 2) 







For an arbitrary function ^, several authors, e.g. Yang (1994) have shown that under 
the condition of flnite second moment, 



oo c2i 



iMdF(s)<oo, (1. 3) 

G{s) ^ ^ ' ^ ^ 



the asymptotic distribution of \fn(Qn ^ &)■, as n goes to infinity, is normal A^(0, a 
wliere 

\2 



2\ 



a- = r (-^(1^(")_^(")) dF(s), ^{s) = r C{x) dF{x), s > 0, (1. 4) 

Jo F (s)G(s) Js 



F\s)G{s 
andG= l-G,F= l-F. 



Confidence intervals for 9 can be constructed using the asymptotic normal distribu- 
tion A^(0, 0"^). Alternatively, the EL method can be used as to be investigated in this 
paper. Employing either method, one needs to deal with a rather complicated form of 
the asymptotic variance o"^. Among other things, it is computationally demanding. 

To use the normal distribution A^(0, cr^), it is necessary to estimate the unknown 
variance o"^. Stute (1996) proposed a jackknife estimator to replace a^ in the calcula- 
tion. Although any consistent estimator a^ of o"^ can be used, the convergence rate of 
0"^ is generally unknown. Substitution by the estimate a^ tends to reduce the coverage 
accuracy for 6 as compare to the case of known cr^. 

The usefulness of the EL method for constructing confidence interval/regions has 
been well established in a wide variety of situations, see e.g., DiCiccio et al. (1991) and 
Chen (1994), and an extensive literature review in Owen (2001) to that day. Let R{0) 
denote the EL ratio function of a one-dimensional parameter 6 for n i.i.d "complete" 
observations. Owen (1988) proved that under certain regularity conditions, —2 log-R(^) 
converges to a chi-squared distribution with one degree of freedom. The EL method 
gives confidence intervals for ^ as {^ : —2 log i?(6') < Ci_q,}, where Ci_a is the (1 — Q;)th 
quantile of the Xi distribution. Here the construction of confidence intervals does not 
require estimation of asymptotic variance. In view of a complicated variance formula in 
(1.4), this would have provided a welcome method for censored data. However, as far 
as we know, for censored data, the asymptotic standard chi-squared distribution holds 
only in some special cases see, e.g. Owen (Chapter 6, 2001). More recent literature 
shows that most of the asymptotic distributions involve weights which are functions 
of unknown variances or covariance matrices. This is the case, for example, in the 



following papers. Li and Wang (2003) studied right-censored regression models, Ren 
(2008) used weighted EL under a variety of censoring models, Wang & Jing (2001) and 
Qin & Zhao (2007) estimated functionals ^, and Hjort& McKeague & van Keilegom, 
in their extension of the scope of the EL method (2009), obtained an asymptotic 
distribution (Theorem 2.1) which is a sum of weighted chi-squared distributions with 
unknown weights. Therefore, using these results to construct confidence intervals for 
9 still require an additional estimation of the unknown cr^. This diminishes a main 
advantage of the EL method for censored data. 

The EL ratio R{0) is obtained by utilizing auxiliary information on 9 through a set of 
estimating equations. In this paper, we show that by using certain influence functions 
with a special construction of estimating equations in the EL ratio, the asymptotic 
distribution of -21ogi?(^) of the functional 9 in (1.1) is a standard x^ without involving 
any unknown scale parameter. Our approach transfers the problem of estimating a"^ to 
the influence functions. As a result, it also significantly simplifies the often intensive 
computations of the EL method for censored data. 

Our work is motivated by the work of Wang & Jing (2001) and Qin & Zhao (2007). 
Wang & Jing (2001) obtained an EL ratio by first finding an estimating equation for 
a certain complete sample and then modifying the estimating equation for the right 
censored sample. The resulting estimating equation is a sum instead of a product 
(inherent of the product limit estimator). With this approach, Wang & Jing (2001) 
use the estimating function Mi(Z, 5, ^) for 9 (see (3.7) ) and Qin & Zhao (2007) use 
M2{Z,5,9) (see (3.8)) for estimating the mean residual life E(F — to|^ > ^o) at age 
to- However, both of these papers obtain an asymptotic scaled x^ distribution for 
-21ogi?(^). 

Instead of Mi and M2, we use infiuence functions. We compute the infiuence functions 

W{Z, 6, 9) of 

l^n= g{x,9)dFn{x), 
Jo 



as defined by (3.6), where F„ is the Kaplan- Meier estimator. The influence functions 
W's are to be utihzed to construct an estimation function for the EL method. Numer- 
ous examples of the function g are given in Section 2. 

The paper is organized as follows. Preliminaries assumptions and examples of 6 and 
g are given in Section 2. The influence function W{Z, 6, 6) are given in Section 3. It is 
shown in Theorem 3.1 that asymptotically ^/n{|J,n—fi) is a partial sum of n independent 
influence functions W{Zj,6j,6) (an IID representation), or is asymptotically linear. 
Here n denotes E g{Y, 9) and the the condition 'Ejg{X, ^) = is not imposed in Section 
3. These results are general for any ^{x) having finite second moment (1.3) and no 
restrictions are placed on the upper boundaries of X and C. 

An IID representation of the Kaplan-Meier estimator has been obtained by many 
authors e.g., Lo & Singh (1986), Stute & Wang (1993) and Chen & Lo (1997) using 
different approaches, under different conditions and in different forms. See Yang (1997) 
and references therein. Here, we use an influence function W{Z,6,9) of 6^ in (1.2) 
obtained in He & Huang(2003). We show that the variance of the influence function 
is smaller than that of Mi and M2 (see Remark 3.1 in Section 3), which results in 
an improvement of the asymptotic coverage accuracies of 6. In Section 4, estimation 
of the influence functions is carried out in Theorem 4.1. The weak convergence of 
— 21og-R(^) to the standard xl distribution without any scale parameter is proved in 
Theorem 4.2 which justifles the EL construction of confldence intervals for censored 
data. In Section 5, simulation comparison of the new method with that of the scaled xl 
distribution is presented. The amount of improvement depends on the form of 6. For 
survival function or mean, the coverage ratios computing from the traditional normal 
approximation and the EL method are about the same. The EL method performs 
better for more complicated 6. Most of the proofs are relegated to the Appendix. 



2 Preliminaries, Assumptions and Examples 

For any right continuous monotone function h{x), let h{x—) or h-{x) denote the left 
continuous version of h{x) and the curly brackets h{x} denote the difference h{x) — 
h{x—). Then h{x} = dh{x). For any cumulative distribution function F, let F = 1 — F. 
Assume that 



F{x) = P(Y < x), G{x) = P{C < x) and H{x) = P{Z < x) 



(2. 1) 



are continuous cumulative distributions of Y, C and Z = min(y, C), respectively. Let 
[0, bn] be the range of H, where 

bfj = sup{a; : H{x) < 1}. 

bp and be are similarly defined for F and G. Then bn = min(6i7, be)- 

Given a sample of n i.i.d. random vectors {Zi,6i), i = 1,2, ■■■ ,n, of {Z,6), their 
empirical distribution functions are given by: 



1 " 

1 " 

1 " 

H4x) = Hl{x) + Hl{x) = -Y.I[Z, < x]. 



(2.2) 



Asymptotic optimal nonparametric estimators of F[x) and G[x) are the well-known 
Kaplan-Meier estimators given by 

s<x 



F^x) = 1 - n 



s<x 



HJs-) 



and Gn{x) = 1 — TT 



HJs- 



(2.3) 



respectively, where an empty product is set equal to one. It can be checked that for all 



X, 



Hn{x) = Fn{x)Gn{x). 



(2.4) 



Applying (2.3) and (2.4) we get 

dF4x) = F4x) - F4x-) = F^x-) "i, (2. 5) 

It follows that 

dHlix) = Gn{x-) dFnix), dH'^ix) = Fn{x-) dG'„(x). (2. 6) 

Put 

H%x) = P{Z < X, (5 = 0), H\x) = P{Z <x,6=l), and (2. 7) 

H{x) = P{Z > x). 

Then 

H\x) = EH^{x) = / F{s)dG{s), 
Jo 

px 

H\x)=EHl{x)= G(s)dF(s), (2.8) 

Jo 

H{x) = EHn{x) = F{x)G{x). 

Here and after, the integral sign J^ stands for J, ^ and J stands for J . 
Examples of 9 and g{x, 9) 
1. g{x^ 9) = I[x > y] — 9 with y fixed. Then 

Eg(Y, 9) = f {I[x >y]-9) dF{x) = 0. 



Solving this equation yields 9 = F{y), the survival function of Y. 

2. g{x, 9) = x^ — 9. Then 9 = EF^, the kth moments of Y. 

3. g{x, 9) = (x — to — 9)I[x > tg] with to fixed. Then 

V o\ - oj P{Y>to) 

the mean residual life of Y. 



4. gix,9)-- 


= x{I[x > y - 


-9). Then 

e= ' r 



sdF(s), 

the length biased survival function of Y . See Vardi (1982), for example. 
5. g{x^ 9) = x"^ — 9x. Then 



1 



EV.o 



oo 



9 = ^- x'dF{x), 



the mean of the length-biased lifetime. 
6. g{x^ 9) = x^ — 29x. Then 



1 f^ 

"=217/ ^'^^W 



the mean of the length biased residual lifetime. 
7. g{x, 9) = I[x < 9] — p with p G (0, 1). Then 9 = F^^{p), the pth quantile of Y. 
Examples (4)- (6) often appear in renewal processes and their applications. 

3 Influence function of /i^ 

Throughout Section 3, ^ is a fixed value. Then it is convenient to suppress 9 in the 
exposition, by setting C,{x) = g{x, 9), fi = E g{X, 9), and 

/xn= fax)dE,{x) = [ ^^dHlix). (3. 1) 

J J Gn{X-) 

Likewise, set W = W{Z, 6) = W{Z, 6, 9). 

In Theorem 3.1, we prove that the estimator /i„ for yU is asymptotic linear. That is, 
there is a function W = W{Z, 6), such that E W^ = 0, Var(W^) < cx) and 



V^ifJ'n - /u) 






The function W{Z, 6) is defined with respect to the true distributions {F, G). Following 
literature, we call W{Zi, 6i) the i-th infiuence function of /i„. See, for e.g. van der Vaart 
(1998) or Tsiatis (2006). 
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Theorem 3.1 will be proved by first establishing a similar result for the truncated 
W{Zi,6i) as defined in (3.3). Let C,b{x) = C,{x)I[x < b] be the restriction of ^(x) on 
(—00,6], where b < bn is an arbitrarily chosen constant. By similar truncation, put 



^ib = / U^) dF{x), Ms) = / U^) dF{x), 6i = I[Yi >a] = l- 6,. (3. 2) 

J Jx>s 

We consider the i.i.d. random variables, 

w^ib) = ^-^-^^, + ^Mz.)- /^,(s)^!^Md/jO(s), (3. 3) 

(^[^i) -n(Aj J H (s) 

for i = 1, ■ ■ ■ ,n. 
Under finite variance condition (1.3), it can be calculated that, 

E^^ = /^., EW,ib) = 0, 



G{s) ^ "' "" J F{s)G\s) 
As b approaches the upper bound 6//, we have 



Var(W^,(6)) = / |^ dF{s) - /ig - / _f'}l]^ ^ dG{s). (3.4) 

J G{s) J F s)G s) 



(3.5) 



fib -^ fi, ip{s) = Mm ipbis) = / ^{x)dF{x), and 



1^,= hm W^,(6) = |^-/.+ =^^(Z,)- /^(s)%^diJ°(s), (3. 6) 
b^bH G{Zi) H{Zi) J h\s) 

for i = 1, ■ ■ ■ , n. The W[s, for i = 1, ■ ■ ■ , n, are i.i.d. random variables. 
Wang & Jing (2001) use the estimating function based on 

M^{Z, 6,9) = 1^-9 (3. 7) 

to estimate 9 in (1.1). Qin & Zhao (2007) used the estimating function based on 

M,(Z,5,^) = 444^, (3.8) 

where g{x, 9) = {x—tQ—9)I[x > to], to estimate the mean residual life 9 = E(Y —tolY > 
to) at a specified age to. This case is covered in our formulation, see example (3) in 



Section 2. Comparing with Mi and M2, our W^s contain two additional terms. Note 
that W^s are not observable random variables and whose estimation will be addressed 
in Section 4. 

Under finite variance condition (1.3), applying the dominated convergence theorem 
and the Lebesgue-Stieltjes integration by parts, we obtain 

E^B^ = fi, EW, = 0, (3. 9) 



Va.T{W,) = I ^dF(s)-/i'- / ^^^2 . dG'(s) (3. 10) 



G{s) ^ ' ^ J F{s)g\s 

F (s)G(s) 

Remark 3.1 Formulas (3.6) are (3.10) are obtained in He & Huang (2003), and 
(3.11) is given in Yang (1994). Under condition (1.3), it can be shown that (3.10) and 
(3.11) are equal. The variance of Wi is smaller than that of Mi and M2 defined in 
(3.7) and (3.8). The variance of the latter two equals 



Iw)''^-'^-'' 



with the corresponding choices of C,{z) = g{z,6), in Mi and M2. 

We proceed to prove Theorem 3.1. For the restricted Wi{b)'s, the following lemma is 
taken from (3.11) of He & Huang (2003). 

Lemma 3.1 Let F and G be continuous. For each fixed, set ^{x) = g{x,6). 
Assume that f ^^{x) dF{x) < 00 and b < bn- Let ^b{x) be the restriction of ^ on (0,6] 
for b < bn. Then, as n ^ 00, 

n fu^) d{F^{x) - F{x)) = ^y] W,{b) + Op{l). 






The following result will be used repeatedly and for easy reference, it is stated in 
Lemma 3.2. Its proof is given the appendix. 
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Lemma 3.2 For b < bn, let {hn{b)} be a random sequence such that hn{b) — )■ h{b) 
in distribution as n ^ oo, and h{b) = Op{l) as b ^ b^j. As n ^ oo, ifVn = Op{l) and 
the random sequence {S'„} can be written as Sn = Op(l) + Vnhn{b) for any b < bn, then 

Sn = Op{l). 

Remark 3.2 In what follows, hn{b) is used as a generic notation to denote any 
random sequence {hn{b)} that satisfies the assumptions of Lemma 3.2. This simplifies 
many of the statements later. For example, under condition (1.3) and b < bn, put 

hn{h)= ^dHl{s). (3.12) 

Jb G (s) 

Then, by the SLLN and (2.8) 



MrnhJb) = lim f " ij^ dHl(s) = [ " ^J^ dH\s) 
^^^Jb G (s) Jb G \s) 

= / =2^G'(s) dF{s) = h{b) ^0, asb^ bn. 
Jb G is') 



lb G (s) 

Theorem 3.1. Let Wi be given by (3.6). Suppose F and G are continuous, and for 
each fixed 9 set ^{x) = g{x, 6). Then under condition (1.3) as n ^ oo, 



n / i{x) d{Fn{x) - F{x)) 



l^Y,W,^Op{\). 



Proof. For b < b^, put ^i, = C,{x) — C,b{x) = C^{x)I[x > b]. Decompose the foUowing 
difference as 

V^ / a^) d{Fn{x) - F{x)) -^y^W, = J,{b) + Ub) - Ub), (3. 13) 



where 



Ji{b) = ^ / U^)d{F,,{x)-F{x)) - y^W,{b), 
J^ih) = ^ jUx)d{FrXx)-F{x)), 



11 



Let 4'h{x) = J^^^^^(s) dF(s). It follows that 

^l{x)<F{xY [ t{s)dF{s). (3. 14) 



•J s>x 

By Lemma 3.1, Ji{b) = Op(l). We shall show that J2ib) = hn{b) and Jsib) = hn(b) 
as n — 7- oo. Applying Corollary 1 of Yang (1994), J2{b) converges weakly to A^(0, al), 
where af is similarly defined as in (3.12) with ^ and ip replacing by their restrictions 
C,fj and ipfj respectively. 

Now 

— 2 ^ / Sbl^ 



at < / ^^ dF{x) ^ 0, as 6 ^ bn- (3. 15) 
J G[x) 

Therefore J2(&) converges to h{b) = ZqUi, in distribution, and h{h) = Op(l) as 6 — )■ bff, 
where Zq is a A^(0, 1) random variable. It follows that J2{b) = hn{b). 

To prove J^ib) = hn{b), note that the difference Wi — Wi{b), as given in (3.6) and 
(3.3) equals to 

^'^^^^^^ ''U^)dF{x) + =^UZ,) - fMs)^-^^^dH\s). (3. 16) 



GiZ,) J - ' - H{Z,)'"' - J ■"- H\s) 

Therefore Wi — Wi{b) are i.i.d. random variables with mean zero and variance o"^. 
Hence, Jz{b) = hn{b) follows for the same reason as that of J2{b). 
We conclude that the following holds for (3.13), 

r 1 " 

V^ / a^) d{F4x) - F{X)) -—J2W, = Op(l) + hnib) + hnib). 
^ ^^ 1=1 

Theorem 3.1 follows from Lemma 3.2. D 

Remark 3.3 If bp < be, then (1.3) is equivalent to ^ having finite second moment. 
If ^ is bounded and away from zero, then (1-3) is equivalent to /g°° dF{s)/G{s). 
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4 Empirical Likelihood Ratios and Confidence In- 
tervals for 9 

To develop an EL inference procedure, we consider a specific g{x, 9). For each fixed 9, 
as before, set ^{x) = g{x,9). We shall utilize the i.i.d. random variables 



W. = W^ + ^fTT^i^iZ,) - Hs)-^=^dH%s). (4. 1) 



G{Z,) HiZ^^ '' J ^^ ' h\s) 
to obtain an estimating equation for the EL ratio. Recall that fi = EWi = J^ C,{x) dF{x) 
and Vai{Wi) are given by (3.10). Note that setting C,{x) = g{x,9) above has nothing 
to do with defining 9 from the equation Eg{X,9) = as given in (1.5). If, however, 
the true parameter 9q is the solution of the equation 

Eg{Y,9) = lg{x,9)dF{x) = 0, (4. 2) 

then ^{x) = g{x, 9q) is such that 

POD 

H = EWi= / ^{x)dF{x) = 0. 
Jo 

Regarding Wi for i = 1, ■ ■ ■ ,n as a "complete" random sample, one could formulate 
an EL likelihood ratio R{9q) with multinomial probability Pi assigned to Wi and the 
constraint Y17=i ^iPi = 0- However, W^s are not observable because of the unknown 
distributions G, F H and H^. We shall replace them by the KM estimates, F„, G„ 
given by (2.3) and an estimate of ip, 

M^)= [ e(s)dF„(s). (4. 3) 

J s>x 

Replacing G, H, H^ in (4.1) by their corresponding empirical distributions in (2.2) gives 
an approximation of Wi in (4.1) by 



Wm = #=T\ + ^rr^MZi) - / Ms)-^^^^dH'^is). (4. 4) 



G^{Z,-) i/„(Z,-)^"^ '^ J ^"^ Hlis- , 
The price to pay for the estimation is that Wn/s are not stochastically independent 
which complicates the ensuing analysis. 
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The following theorem indicates the possibility of using Wni to construct empirical 
likelihood ratio and to obtain asymptotically a standard x^ distribution. 

Theorem 4.1. Let Wni be given by (4-4) (^f^d E^{Y) = 0. Then under condition (1.3), 
as n ^ oo, we have 



- /r ^ It 

Proof. By (2.6), we have 

n 



(4.5) 



ni 

n ; 
1=1 



n g (5:(ZH + 5:(^'''"'^-' - i "'"'■''^^'^"<^' 



J Gnis-) J Hn{s-) J HJs-) 



'-nK^—) J Has- 

Jjs) 

Gn{s — 

Applying J C,{s) dF{s) = EC,{Y) = and Theorem 3.1, we arrive at 

-^TWm = V^ as) d(F„(s) - F{s)) = V W^. + 0,(1). (4. 6) 

D 

Following Owen (2001), define the EL ratio of ^ by a multinomial likelihood subject 
to constraints as 

{n n n "^ 

l[np,\ ^p, = 1, 5^p,W^„, = 0,p, >0, z = l,2,---,nl. (4. 7) 

To determine -R(^), we solve, as usual, for the Lagrange multipliers /i and A in 



n 



A = ^log(npi) - nX{^piWni) - fi{l -^Pi)- 

i=l 1=1 j=l 

Then /x = —n and pj = — (1 + AW^„j)^^, i = 1, 2, ■ ■ ■ , n, where A is the solution of 

n 



-E- 



t=l 
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The uniqueness of A will be addressed in the proof of Theorem 4.2. The EL ratio of 
6 can be written as 

n n 

R{e) = l[{np,) = J](l + XWm)-\ (4. 8) 

4 = 1 j=l 

Theorem 4.2. Suppose that 9q is the unique solution of (4-2) and finite second moment 
(1.3) holds. Set ^{x) = g{x,6o). Then 1{9q) = — 21og-R(^o) converges in distribution to 
a Xi random variable with one degree of freedom, as n — )> oo. 

Applying Theorem 4.2, confidence intervals for 6 can be constructed as 

h = {9: m < ci_ J, (4. 9) 

where Ci_q, is the (1 — a)th quantile of the x^ distribution. Ji has asymptotic coverage 
probability of 1 — a, as n — )■ oo. 

To prove Theorem 4.2, we shall make use of the following Taylor's expansion of 
-21ogi?(^o), 

n 

-21ogi?(^o) = 2^1n(l + A„W„,) 

i=l 
n .. 

= ^2(a„1^„,--A21^„U^.) ( here Ir^.l < |X,|3) 

i=l 

n 

= 2Xr,nW^-XlnSl + 2J2v^ 



i=l 



-2 



= 2n^ - -^nWlia' + o,il)) + o,il) 

nW„ ... 2 .■,..,. 

= — - — h Op(l) — )■ Xi) 111 distribution, 

where 

n 

Wn = Y. W'"- and si = E:Li W^,. (4. 10) 

j=i 

Asymptotic analysis of Wn = X]r=i ^™ ^^^ "^n — Yll=i ^ni ^^^ needed for establishing 
the last two equalities in the expansion. It will be proven in Lemma 4.3 and Theorem 
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4.2 that both of these averages are related to the asymptotic variance (1.4) or (3.10). 
The following lemmas are needed for proving Theorem 4.2. 

Lemma 4.1 Let fn{x) and f{x) be monotone functions defined on the range of Z . If 
f{x) is continuous and for x G [0,6/^], fn{x) — )■ f{x), as n -^ oo, then fn{x) converges 
to f{x) uniformly on [0,6//]. The proof is omitted. 

Lemma 4.2 LetVni = WniHn{.Zi—)H{Zi) andVi = WiH (Zi). Under the conditions 

of Theorem 4-2 and 9 = Oq, as n -^ oo, 
1 " 

(1) -j:iWm-w,y = o,{i), 

ni=i 

1 " 
(2) -E(K,-V^,)'^0, a.s.. 
n j=i 

The proof is relegated to the Appendix. Lemma 4.2 is needed for showing that 
with probability 1, for large n the set {W„i} contains a positive and a negative value. 
To facilitate the proof, V^^s, a modification of W^^s, are introduced to deal with the 
problem at the boundary hu- It follows that for large n there exists a unique A„ for 
R{e) in (4.8). 

Lemma 4.3 Let Wni and Wi he given by (4-4) '^''^d (4-1), respectively. Under the 
conditions of Theorem 4-2 and 6 = 6q, as n ^ oo, 

1 " 
max \Wni\ = Op{y/n), - ^^ W^^i = o"^ + Op(l), 

l<i<n n •'— ' 



n . 
1=1 



and 



1 " 
—= ^ Wni -^ A^(0, (T^), in dist., 



n 

i=l 



where a"^ = Var{Wi) is given by (3.10). 
The proof is relegated to the Appendix. We now prove Theorem 4.2. 



Proof of Theorem 4-2. The Lagrange multiplier A in (4.6) appears in the equation 

1 - 



MAJ^Eiq^F-^O. (4.U) 
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We shall show that for large n, h{\) = has a unique solution A„ such that XnWni > —1 
for all i. Put 

\oO, Wni = 0. 

Let ?7(i) < f/(2) < ■ ■ ■ < ^(n) be the ordered statistics of f/i, U2, ■ ■ ■ , ?7n- Then 

n 



h(x) = 1 V ^"^ = i V — 



. , - 'ni n— A- Uli\ 

1=1 1=1 ^ ' 

is monotone and differentiable in A on each nonempty interval (f/(j), f/(i+i)). We claim 
that for large n, there exists an i such that f/(j) < < f/(j+i) • To see this, we note that 
for every e > 0, by Lemma 4.2(2) 

^ n 1 " 

- V/[ IK. - V.I > ^] < - E(^- - ^^)V^' = «(!)> a.s.. 
Using the fact that I[Vi > e] < I[Vni > e/2] + I[\Vni - Vi\ > e/2], we get 
- V/IK. > e/2] >-y^I[V, > 5] + 0(1), a.s.. 

i=l i=l 

Using the fact that P{Vi > 0) = P{Wi > 0) > 0, it is seen that for some e > 0, 

1 " 
liminf-V /[Ki > e/2] > P{Vi > e) > 0, a.s.. 

ra— )-oo 77, •^— ' 
i=l 

Similarly, we have 

1 " 
liminf - V/[Ki < -£/2] > P(Vi < -e) > 0, a.s.. 



n— ^-oo n 

i=l 



Since l^j and Vni have the same sign, hence the claim is true. It follows that there 
is a unique A„ G (f/(j), f/(i+i)) = (— l/max{lV„j}, — l/min{W^„j}) such that /i(A„) = 
and A„max{W„j} > —1 and A„min{iy„j} > —1. 
The rest of the proof is similar to that of Owen (2001). In fact, setting 

-t n 1 "■ 

i=l i=l 
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we have S"^ = a"^ + oJ\ 



■py^-}^ 



itrVxrlt '-^^ - ^« - ^'MK) ^ KW, 



i=l ' j=l 

and 






i=l i=l 

It follows that 



and 



o-^ + Op(l) 



Applying Lemma 4.3, we have 

n n 

V |X,|3 < A^ V |W^™|2 max \Wm\ = Op{n-^'^)Op{n)op{n^'^) = Op(l). 






Therefore the Taylor expansion (above (4.10)) is valid from which the theorem follows. 

D 

Remark 4.1 We are able to obtain the standard asymptotic xf distribution for 
— 21ogi?(^o) is because the asymptotic variance of 

-. n -. n 

(which is 0"^ = Var{Wi)) equals the limit of (see Lemma 4-3) 

If —2\og{EL ratio) is based on the estimating function M2 = M2{Z,6,9) in (3.8) ( 

or in (3. 7)), then 

.r _ 9{Zi,0)6i 

* n.i 



1 - Gn{Zi) 
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will be used to construct —2 log{EL ratio) . Now, the asymptotic variance of 

-. n ^ n 



t=l * 4=1 



is 0-2 (see(3.10)), hut the limit of \ YTi=i K 



or 



2 



n ^-^ \ n ^-^ 

1=1 j=i 

is (see Remark 3.1) 

J G{s) 
Therefore, a scaled parameter r = crf/cr'^ must be introduced in order to obtain the 

asymptotic distribution for —2log{EL ratio) as in Wang & Jing (2001). 

5 Simulation 

Simulations are carried out to study and compare finite sample performance of confi- 
dence intervals Ji in (4.9) derived from Theorem 4.2 and I2 from the scaled Xi distri- 
bution given by Wang & Jing (2001) and Qin & Zhao (2007). 
To calculate Ji, Wni in (4.4) is used, where 

'^'" - 54^ + K(5h *'* ■' " U Hi(z,-) • 

and ipn{x) is given by (4.3). 

Confidence intervals I2 are calculated as follows. Let F„, and G*„ be the Kaplan-Meier 
estimators defined by (2.3). Suppose 9 is the unique solution of f g{s, 6) dF„(s) = 0. 
Set 

^i=g{Zi,e), ii = g{Zi 



■'D '^ }■) 






-. n 1 " 



n '' — ' n 

i=\ i=\ 
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nVar (jack) 
- — -* 
where n Var (jack) is the modified jackknife estimator of the asymptotic variance of ^ 

given in Stute(1996). Then, the EL-based confidence interval for 6 is 

I^=h: 2f ^log(l + \Vm) < ci„« \ , (5. 2) 

where A is the solution of XliLi K.j/(1 + A\4i) = 0. 

Simulations were performed in two scenarios. In scenario I, the parameter of interest 
is ^0 = E y and in scenario II, the mean residual lifetime. 

Scenario I: The parameter of interest is ^o = E F and ^{x) = g{x,6) = x — 6 is used 
for calculating Ji. Two cases were simulated: 

(i) The lifetime Y is uniformly distributed on (0, 1) and the censoring time C is 
uniformly distributed on (0, c). We selected c = 2.5 and c = 1.3 which corresponds 
respectively to 20% and 30% censoring proportions. 

(ii) Y has a Weibull(l, 10) distribution and C has an Exp(A) distribution. Then for 
A = 4.3 and A = 2.7, the corresponding censoring proportions are 20% and 30%. The 
simulated observations are n i.i.d. copies oi Z = min(y, C),6 = I[Y < C]. Based on 
the simulated observations, confidence intervals Ji derived from Theorem 4.2 and I2 
from (5.2) were calculated. The process was repeated for N = 2 x 10^ times and the 
coverage proportions and the average width of the confidence intervals were calculated 
using the A^ data sets. The results are summarized in Table 1 and Table 2. 

The following are noted. 

(1) As the sample size n increases, all of the coverage proportions converge to the 
nominal level 1 — a. 

(2) For Uniform(0, 1) distribution, Ji has better coverage proportions. In 8/16 of the 
cases, the average width of I2 is slightly shorter than that of Ji. In 8/16 of the cases, 
I2 and Ji have the same average width. 
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Table 1: The coverage proportions for the true 6q = EY 



20% censoring proportion 


Nominal Value 
l-a 


Sample Size 
n 


Uniform(0, 1) 
h h 


Weibull(l, 10) 
h h 


0.90 


20 


0.876 


0.881 


0.871 


0.871 




40 


0.895 


0.897 


0.889 


0.890 




60 


0.897 


0.897 


0.893 


0.893 




80 


0.897 


0.898 


0.896 


0.896 


0.95 


20 


0.928 


0.935 


0.922 


0.924 




40 


0.946 


0.949 


0.939 


0.941 




60 


0.947 


0.948 


0.945 


0.946 




80 


0.947 


0.947 


0.947 


0.948 


30% censoring ] 


proportion 










Nominal Value 

l-a 


Sample Size 
n 


Uniform(0, 1) 
h h 


Weibull(l, 10) 
h h 


0.90 


20 


0.841 


0.861 


0.867 


0.869 




40 


0.885 


0.890 


0.890 


0.891 




60 


0.888 


0.892 


0.890 


0.891 




80 


0.897 


0.900 


0.893 


0.894 


0.95 


20 


0.897 


0.916 


0.916 


0.924 




40 


0.934 


0.941 


0.939 


0.943 




60 


0.941 


0.946 


0.944 


0.946 




80 


0.945 


0.947 


0.945 


0.947 
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Table 2: The average width of confidence intervals for 6q = EY 



20% censoring proportion 


wii 


dth 


wi 


dth 


Nominal Value 
l-a 


Sample Size 
n 


Uniform(0, 1) 
h h 


Weibull(l, 10) 
h h 


0.90 


20 


0.217 


0.218 


0.092 


0.091 




40 


0.157 


0.157 


0.066 


0.065 




60 


0.129 


0.129 


0.054 


0.053 




80 


0.112 


0.112 


0.046 


0.046 


0.95 


20 


0.258 


0.259 


0.110 


0.109 




40 


0.187 


0.187 


0.079 


0.078 




60 


0.154 


0.154 


0.064 


0.064 




80 


0.133 


0.133 


0.056 


0.055 


30% censoring ] 


proportion 


wii 


dth 


width 


Nominal Value 

l-a 


Sample Size 
n 


Uniform(0, 1) 
h h 


Weibull(l, 10) 
h h 


0.90 


20 


0.220 


0.227 


0.097 


0.096 




40 


0.162 


0.164 


0.069 


0.069 




60 


0.134 


0.134 


0.057 


0.057 




80 


0.116 


0.116 


0.049 


0.049 


0.95 


20 


0.260 


0.270 


0.116 


0.116 




40 


0.192 


0.196 


0.083 


0.083 




60 


0.159 


0.160 


0.068 


0.068 




80 


0.138 


0.139 


0.059 


0.059 
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Table 3: The sample variances of {W^ni} and {Vm}, ^o = E^ 



20% censoring proportion 



Uniform(0, 1) 



Weibull(l, 10) 



Sample Size n 



'w 



^w 



'y 



20 


0.0935 


0.1121 


0.0157 


0.0163 


40 


0.0938 


0.1115 


0.0157 


0.0162 


60 


0.0937 


0.1107 


0.0157 


0.0161 


80 


0.0934 


0.1100 


0.0158 


0.0162 



30% censoring proportion 



Uniform (0,1) 



Weibull(l, 10) 



Sample Size n 



'W 



s2 



'VK 



Sy 



20 


0.1005 


0.1386 


0.0175 


0.0185 


40 


0.1013 


0.1401 


0.0176 


0.0184 


60 


0.1016 


0.1402 


0.0176 


0.0183 


80 


0.1012 


0.1393 


0.0176 


0.0183 



(3) For Weibull(l, 10) distribution, Ji has better coverage proportion and width. 

In the jth simulation, {VTni} and {Vni} were calculated according to (4.4) and (5.1) 
respectively. Then the sample means of {W^ni} and {Vni} are the same (see the proof 
of Theorem 4.1). But the sample variance s^(j) of {W^m} and the sample variance 
Sy{j) of {Vni} are different. Let 

N _ N 



^W 



j;^J2sl,{j), 4 = -^4(j) 



i=i i=i 

Table 3 shows that the sample variance of {VTnj} is smaller than that of {Kij}. This is 
proved in Remark 3.1 for the population variances. 

Scenario II: Let g{x,6) = {x — t^ — 6)I[x > to]. Then by solving the equation 
Eg{Y, 6) = 0, we obtain the mean residual life of Y, 

E{Y-to)I[Y>to] 



eo = E{Y-to\Y>t, 



(5.3) 



P{Y > to) ' 

as studied in Qin & Zhao (2007). Let Y have a Weibull (1, 10) distribution and C 
have an Exp(A) distribution. By setting A = 4.3 and A = 2.7, we achieved 20% and 
30% censoring proportions respectively. As in Scenario I, each simulation was repeated 
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Table 4: The coverage proportion and average width of confidence intervals for 
6q = E(Y — to|F > to) under the assumptions of F ~ Weibull(l, 10), 20% censor- 
ing proportion, and 1 — a = 0.90. 



Sample Size n 


Method 




Coverage Ratio 






Average 


3 Width 








0.90 


P{Y 
0.70 


>to) 
0.50 


0.30 


0.90 


P{Y 
0.70 


>to) 
0.50 


0.30 


20 


h 
h 


0.878 
0.881 


0.851 
0.863 


0.795 
0.820 


0.659 
0.701 


0.074 
0.074 


0.062 
0.062 


0.054 
0.056 


0.044 
0.048 


40 


h 
h 


0.889 
0.891 


0.878 
0.884 


0.859 
0.874 


0.800 
0.833 


0.053 
0.053 


0.046 
0.046 


0.042 
0.043 


0.039 
0.041 


60 


h 
h 


0.897 
0.898 


0.892 
0.897 


0.877 
0.888 


0.839 
0.863 


0.044 
0.044 


0.037 
0.038 


0.035 
0.035 


0.034 
0.035 


80 


h 
h 


0.895 
0.896 


0.888 
0.892 


0.884 
0.892 


0.853 
0.871 


0.038 
0.038 


0.033 
0.033 


0.031 
0.031 


0.030 
0.031 



N = 2x 10^ times. The coverage proportion of the N data sets and their average width 
were calculated. The results are summarized in Table 4 and Table 5, respectively. 
The following are noted from Tables 4 and 5. 

(1) As the sample size n increases, all of the coverage proportions increase and are 
close to the nominal levels. 

(2) The coverage proportions of Ji are much better than that of 12- 

(3) In 15/32 of the cases, the average width of I2 is slightly shorter than that of Ji. 
In 17/32 of the cases, I2 and Ji have the same average width. 

6 Appendix 

Proof of Lemma 3.2. Put rjn = 0^(1). By assumptions, for any 5 > and 6 > 0, 
there exist M > 0, b < bn and no > 1 such that for n > no, -P(|Ki| > M) < 6, 
P{\hn{b)\ > e/M) < P{\h{h)\ > e/M) + 5/2<5 and P(|r7„| > e) < 5 . It follows that 
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Table 5: The coverage ratio and average width of confidence intervals for ^o = E(^ ~ 
^o|^ > ^o), under the assumptions Y ~ Weibull (1, 10), 30% censoring proportion and 
I- a = 0.90. 



Sample Size n 


Method 




Coverag 


;e Ratio 






Average 


i Width 








0.90 


P{Y 
0.70 


>to) 
0.50 


0.30 


0.90 


P{Y 
0.70 


>to) 
0.50 


0.30 


20 


h 


0.864 
0.872 


0.833 
0.851 


0.760 
0.793 


0.605 
0.659 


0.079 
0.079 


0.065 
0.065 


0.055 
0.058 


0.043 
0.048 


40 


h 
h 


0.887 
0.891 


0.872 
0.882 


0.846 
0.867 


0.777 
0.818 


0.057 
0.057 


0.048 
0.049 


0.045 
0.046 


0.041 
0.043 


60 


h 


0.892 
0.895 


0.888 
0.895 


0.870 
0.884 


0.822 
0.851 


0.046 
0.046 


0.040 
0.040 


0.037 
0.038 


0.036 
0.037 


80 


h 
h 


0.892 
0.895 


0.888 
0.895 


0.878 
0.887 


0.845 
0.869 


0.040 
0.040 


0.035 
0.035 


0.033 
0.033 


0.032 
0.033 



for n > no, 



P{\Sn\>2e) < P{\vn\>e) + P{\Vnhn{b)\>e) 

< 6 + P{\VnK{b)\>e,\K{b)\<e/M) + 6 

< P{\Vn\ >M)+26 

< 36. 
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Proof of Lemma J^.2. The differences Wni — Wi in eq. (4.4) and (4.1) can be expressed 
in terms of 

7i = ^ 



Gn{Zi — ) G{Zi) 

J H^is-) J His) 



as 



Wni -Wi = 'ji + r]i-Ui 
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Applying an elementary inequality (a + 6 + c)^ < 3(a^ + 6^ + c^), we obtain 

(W^n. - W,f = (7. + V^- ^0' < 3(7- + V- + ^- )• (6. 1) 

The lemma will be proven by showing that the sample means of 'yf,f]f and uf tend 
to zero in probability. The proofs will be presented in (A), (B) and (C) below. 

(A) The sample mean of 7^ is Op(l). 

Proof: Let Gn{x) be the K-M estimator defined in (2.3) and b < Bh- Then as n — )> 00, 

\G^{s-) - G{s)\ IGr^js-) - Gis)\ 

Un = sup ^-— = Op(l), Vn = sup ^- ^ = Op(l). (6. 2) 

s<b Gn[S) s<max{Zi} <-T„(S-j 

See Zhou (1992). To apply this result, we shall in the following proof split the integrals 
into two intervals [0, b] and (6, bn] accordingly. 
For any b < bn, using (2.2), we have 



«W ^W^'dff'M 



1 " 

-Ti' = 

Jo G (s) Jb G (s) 

= Op{l)Op{l) + Op{l)hn{b) = Op(l), as n ^ oo, (6. 3) 

where U^ and V^ are given by (6.2) and 

rbn ^2/ \ 

K{b)= ^dHlis). (6.4) 

Jb G (s) 

Recall that hn{b) is explained in (3.12). It was shown that hn{b) satisfies the conditions 
in Lemma 3.2. The proof follows by invoking Lemma 3.2. 



(B) The sample mean of rjf is Op(l). 






Proof: For b < bn, define 






Tnib,t]= r^^d/j°(.), 

Jb H^{s-) 


Snib^t]-- 


Jb h\s) 



dH2(s). 



26 



Observe that 



^n(s) = ( / e(«) dF^y < F„(s-) f eiu) dFM 

J u>s J u>s 



and Fn and G„ have no common jumps. It follows that 

is-] 



TnibM < ["([ e{u)dFM)Fl{s~)^ 
< ['"([ e{n)dFM)d(=^^ 



< hm ^— / eiu)dFM+ 4^dF,,is) 



^^bn Gnis) Js Jb G'„(s , 

< 2f"^j^dF^{s) 
Jb Ct„(s-) 

Jb ^Gn{s-} G{s} G{s}y 
= Op{l) + Op{l)hn{b). (6. 5) 

The first inequality follows from (2.6) the second and the third from integration by 
parts, the fifth from (2.5) and the last equality from (a + b)^ < 2(a^ + 6^), (6.3) and 
(6.4). 

By the same token, we conclude that Sn{b,bH] = Op{l) + Op(l)/i„(6). 

Write 

i = e-c.M^)= I t{s)dFn{s)- j r(s)dF„(s), 

J s>x J s>x 

where ^'^ and ^~ are the positive and negative part of ^. Define monotone functions: 



^ni^) = / e^(s) dF„(s), ^^{x) = / ^Hs) dF{s). 

J s>x J s>x 

'ipni^) converges to ip'^i^x) almost surely for x G [0,6^;/] as shown by Stute & Wang 
(1993). Furthermore, by Lemma 4.1, the convergence is uniform on [0,6^]- 
From these we conclude the uniform convergence of ipn to ip, 

sup lipnix) — ipi^x)] = o{l), a.s.. (6. 6) 

0<x<bH 
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Therefore, for b < bn, 



sup ^ — - - ^^- -)■ 0, a.s.. 



Applying (a + 6)^ < 2a^ + 26^ and Lemma 3.2, we have 



n^'' Jo A ^HJs-) His)) "^ ^ 



= Op{l) + 0^{l)K{b)+Op{l)K{b) = Op{l). (6.7) 

(C) The sample mean of vf is Op(l). 
Proof: Write, for < a < t, 

i){s)dH^{s) 



Ja H^AS — ) Ja H is) 



Then, for b < bn, v/e have 



A^, = [" Blib, t] dH4t) = [ " Blib, t] d(-/J„(t)) 

Jb Jb 

< Hnih)Blib,b] + 2 r H^it-)Bnib,t]^^<lHlit) 

Jb Hjt-) 



< + 2( f" BlibMHn{t))"'\ r Sj^dH^it)^'^" 
^Jb ^ ^Jb Hjt-) ' 

= A„2[T46,6h)]'/'. 



Kit- 

Hl{t-) 



The second term in the first inequality is obtained using the Lebesgue-Stieltjes inte- 
gration by parts. 
Applying (6.5), we get 

r Bl{b,t\dHn{t) = ^l< 4T„(6,6h] = Op(l) + 0p(l)/i„(6). 
Jh 

Similarly, for Sip, bu] = ES'„(6, bu], we have 

"bH 

D\b, t] dH^it) < AS{b, bn] + Op(l) = Op(l) + Op{l)K{b). 
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Applying the uniform convergence of ipn to ip (see(6.6)), we conclude that for any 
b < bff, with probability 1, -B„(0,t] — )> D{0,t] uniformly on [0,6]. It follows that 
5„(0,6] ^ D{0,b], and 

-E^^ = [ + [" {B40,t]-DiO,t]Y dHr.it) 
n .^^ Jo Jb 

= Op(l)+ [" {Br,{0,b]-D{0,b] + Bn{b,t]-D{b,t]YdHn{t) 
Jb 



2/1, j-1 J u /x\ I A / n2/ 



< Op(l) + 4 / 5;t(^ ^] di^n(t) + 4 / D^(6, t) diJ„(t) 

= Op(l) + Opil)hnib) + Opil)hnib) = Op(l). 

Now, we prove result (2) of the lemma. Introduce Ani = WiHn{Zi—)H{Zi). From 
(6.1) we get 

(K. - A„,)2 < 3(7,' + T]^ + uf)Hl{Z,-)H\z,). 

Similar to (6.3) and (6.7), we have 



1 _ _ 

-J2i^hI{z.-)h\z,) 

i=l 

r (J^, - ^yHl{s-)H\s) dH'^is) = o(l), a.s., 
Jo ^Gn{s-) G{s)y 

-J2^^hI{z.-)h\z,) 



n . 



' HJs-)H {s]dH„{s] = o 1 , a.s.. 



.0 ^H^{s-) H{s) 
Since for any b < bn, with probability 1, 

\BniO,t]-D{0,t]\HrXt-)Hit)^0 

uniformly on [0,6], and 



sup|5„(0,t]-D(0,t]|/J„(t-)if(t) 

t<bH 
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is bounded by some constant, it follows that 

-Y.yfHl{Z,-)H\z,) 
1=1 

= r [(5„(0,t] - D(0,t])F„(t-):^(t)]'diJ„(t) = o(l), a.s.. 
Jo 

Now we get 

- Y.^Vm - A^^f < 3- ^(7f + V^ + uf)Hl{Z,-)H\z,) = o(l), a.s.. 

i=l i=l 

At last, we have 

n ^ — ^ n ^ — ^ n ^ — ' 

j=i 1=1 j=i 

= oil) + -y2iHiZ,)-Hr,iZ,-)yW^H\z,) 

%=\ 
= o(l), a.s.. 

D 

Proof of Lemma 4-3. Since Wi are i.i.d. random variables with zero mean and finite 
variance a^, hence max |VFj| = Op{y/n). It follows from Lemma 4.2 that 

l<j<n 

max |W^„j| < max |lV„j - 11/j| ) + max |W^j| 

l<.i<n \ l<i<n / l<i<n 

1 ^ 1 /9 

i=l 

= OpiV^). (6. 8) 

Note that W^^ is bounded by 

W^ + {W, - Wmf - 2\W,{W, - Wni)\ < Wl <Wf + {Wm " W,f + 2\W,{W, - Wni)\. 

By Lemma 4.2, we get 



1 

n— s-co 7^ •^— ^ 
j=l 

The last result follows from Theorem 3.1, S,{x) = g{x,6Q) and (4.6). 
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